Information content and word frequency in natural language: Word length matters

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Information content and word frequency in natural language: word length matters.

For centuries, scientists have attempted to uncover commonalities that underlie the structure of human languages (1). In a recent issue of PNAS, Piantadosi et al. (2) reported an exciting finding with respect to one unique type of language universal. The authors empirically demonstrated that word length strongly correlated with information content across 11 distinct natural languages. This find...

متن کامل

Word Length Andword Frequency

Since the appearance of Zipf’s works, (esp. Zipf 1932, 1935), his hypothesis “that the magnitude of words tends, on the whole, to stand in an inverse (not necessarily proportionate) relationship to the number of occurrences” (1935: 25) has been generally accepted. Zipf illustrated the relation between word length and frequency of word occurrence using German data, namely the frequency dictionar...

متن کامل

Word-length entropies and correlations of natural language written texts

We study the frequency distributions and correlations of the word lengths of ten European languages. Our findings indicate that a) the word-length distribution of short words quantified by the mean value and the entropy distinguishes the Uralic (Finnish) corpus from the others, b) the tails at long words, manifested in the high-order moments of the distributions, differentiate the Germanic lang...

متن کامل

Information content versus word length in natural language: A reply to Ferrer-i-Cancho and Moscoso del Prado

Recently, Ferrer i Cancho and Moscoso del Prado Mart́ın (2011) argued that an observed linear relationship between word length and average surprisal (Piantadosi, Tily, & Gibson, 2011) is not evidence for communicative efficiency in human language. We argue that their study of a random typing model is largely irrelevant to human language: their model critically rests on incorrect assumptions abou...

متن کامل

Word-Forming Process in Azeri Turkish Language

The subject intended to study the general methods of natural word-forming in Azeri Turkish language. This study aimed to reach this purpose by analyzing the construction of compound Azeri Turkish words. Same’ei (2016) did a comprehensive study on word-forming process in Farsi, which was the inspiration source of this study for Azeri Turkish language word-forming. Numerous scholars had done vari...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the National Academy of Sciences

سال: 2011

ISSN: 0027-8424,1091-6490

DOI: 10.1073/pnas.1103035108